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The discovery of new G-protein coupled receptors, and the polynucleotides encoding them, 

satisfies a need in the art by providing new compositions which are useful in flie diagnosis, prevention, 

and treatment of cell proliferative, neurological, cardiovascular, gastrointestinal, 

autoimmuneflnflammatory, and metabolic disorders, and viral infections, and in the assessment of the j^n 

effects of exogenous compounds on flie expression of nucleic acid and amino acid sequences of G- ^ 

m 

protein coupled receptors. ^ 



SUMMARY OF THE INVENTION 
The invention features purified polypeptides, G-protein coupled receptors, referred to 

10 collectively as "GCREC" and mdividually as "GCREC-1," "GCREC-2," "GCREC-3," "GCREC-4," 
"GCREC".5," "GCREC-6," "GCREC-7," "GCREC-8," "GCREC-9," "GCREC-10," "GCREC-11," 
"GCREC-12," "GCREC-13," "GCREC.14," "GCREC-15," and "GCREC-16." In one aspect, the 
invention provides an isolated polypeptide selected fi:om the group consisting of a) a polypeptide 
com^nsm% an amino acid sequence selected bom &e group comisimg of SEQ ID NO:l-16, b) a 

15 polypeptide comprising a naturally occurring amiao acid sequence at least 90% identical to an amino 
add sequence selected from the group consisting of SEQ ID N0:l-16, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID N0:l-16, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-16* In one alternative, die invention provides an 

20 isolated polypeptide comprising the amino add sequence of SEQ ID NO: 1-16. 

The invention further provides an isolated polynucleotide encoding a polypeptide selected from 
the group consisting of a) a polypeptide comprising an amino add sequence selected from the group 
consisting of SEQ ID NO:l-16, b) a pol3^ptide comprising a naturally occurring amino acid sequence 
at least 90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO:l- 

25 16, c) a biologically active fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-16, and d) an immunogenic fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SBQ ID NO:l-16. In one alternative, the 
pol^ucleotide encodes a polypeptide selected from the group consisting of SEQ ID NO:l-16. In 
another altemative, the polynucleotide is selected from fiie group consisting of SEQ ID NO:17-32. 

30 The invention additionally provides Q-protein coupled receptors that are involved in olfactory 

and/or taste sensation. The invention further provides polynucleotide sequences that encode said G- 
protein coupled rec^tors. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 

35 consisting of a) a polypeptide comprising an amino add sequence selected fix)m Oie group consisting 
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of SEQID NO:l-16, b) a polypeptide comprising a natuiaJly occurring amino acid sequence at least 

90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO:l-16, c) a 

biologically active fragment of a polypeptide having an amino acid sequence selected from the group 

consisting of SEQ ID N0:l-16, and d) an immunogenic fragment of a polypeptide having an amino 

5 add sequence selected from the group consisting of SEQ ID NO:l-16. In one alternative, the 

invention provides a cell transformed with the recombinant polynucleotide. In another alternative, the 

invention provides a transgenic organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide selected from the group 

consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 

10 of SEQ ID NO:l-16, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO:l-16, c) a 
biologically active fragment of a polypeptide having an amino add sequence selected from the group 
consisting of SEQ ID NO:l-16, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-16, The method comprises a) 

15 culturing a cell under conditions suitable for expression of the polypeptide, wherein said cell is 

transformed with a recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide encoding the polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid sequence 

20 selected from the group consisting of SEQ ID N0:l-16, b) a polypeptide comprising a naturally 
occurring amino acid sequence at least 90% identical to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-16, c) a biologically active fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-16, and d) an immunogenic 
fragment of a polypeptide having an amino add sequence selected from the group consisting of SEQ 

25 IDN0:M6. 

The invention farther provides an isolated polynucleotide selected from the group consisting of 
a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO: 17-32, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:17-32, 

30 c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to 
the polynucleotide of b), and e) an RNA equivalent of a)-d). In one alternative, the polynucleotide 
comprises at least 60 contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide selected from the group consisting of 

35 a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
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ID NO:17-32, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 

90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO: 17-32, 

c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to 

the polynucleotide of b), and e) an RNA equivalent of a)-d). The method comprises a) hybridizing the 

5 sample with a probe comprising at least 20 contiguous nucleotides comprising a sequence 

complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to 

said target polynucleotide^ under conditions whereby a hybridization complex is formed between said 

probe and said target polynucleotide or fragments thereof, and b) detecting the presence or absence of 

said hybridization complex, and optionally, if present, the amount thereof. In one alternative, the probe 

10 comprises at least 60 contiguous nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, said 
target polynucleotide having a sequence of a polynucleotide selected from the group consisting of a) a 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:17-32, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 

15 identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:17-32, c) a 
polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to the 
polynucleotide of b), and e) an RNA equivalent of a>d). The method comprises a) amplifying said 
target polynucleotide or fragment thereof using polymerase chain reaction amplification, and b) 
detecting the presence or absence of said amplified target polynucleotide or fragment thereof, and, 

20 optionally, if present, the amount thereof. 

The invention farther provides a composition comprising an effective amount of a polypeptide 
selected from the group consisting of a) a polypeptide comprising an amino acid sequence selected 
from flie group consistmg of SEQ ID NO:l-16, b) a polypeptide comprising a naturally occurring 
amino acid sequence at least 90% identical to an amino acid sequence selected from the group 

25 consisting of SEQ ID NO:l-16, c) a biologically active fragment of a polypeptide having an amino add 
sequence selected from the group consisting of SEQ ID NO:l-16, and d) an immunogenic fragment of 
a polypeptide having an amino acid sequence selected fix>m the group consisting of SEQ ID NO:l-16, 
and a pharmaceuticaDy acceptable excipient. In one embodiment, the composition con^rises an amino 
acid sequence selected from die group consisting of SEQ ID N0:l-16. The invention additionally 

30 provides a method of treating a disease or condition associated with decreased e:q»:ession of 

functional GCREC, comprising administering to a patient in need of such treatment the composition. 

The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected frx>m group consisting of SEQ ID NO:l-16, b) a polypeptide comprising a 

35 naturally occurring amino add sequence at least 90% identical to an amino acid sequence selected 
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naturally occuning amino acid sequence at least 90% identical to an anoino acid sequence selected 

from the group consisting of SEQ ID NO:l-16, c) a biologically active fragment of a polypeptide 

having an amino acid sequence selected from the group consisting of SEQ ID N0:l-16, and d) an 

immunogenic fragment of a polypeptide having an amino add sequence selected from the group 

5 consisting of SEQ ID N0:l-16. The mefliod comprises a) combining the polypeptide with at least one 

test compound under conditions permissive for &e activity of the pol^qptide, b) assessing the activity 

of the polypeptide in the presence of the test compound, and c) comparing the activity of the 

polypeptide in the presence of the test compound with the activity of the polypeptide in the absence of 

the test compound, wherein a change in the activity of the polypeptide in the presence of the test 

10 compound is indicative of a compound that modulates the activity of the polypeptide. 

The invention further provides methods of using G-protein coupled receptors of the invention 
involved in olfactory and/or taste sensation, biolo^cally active fragments thereof (including those 
having receptor activity), and amino acid sequences having at least 90% sequence identity therewith, 
to identify conapounds fliat agonize or antagonize the foregoing receptor polypeptides. These 

IS compounds are useful for modulating, blocking and/or mimicldng specific tastes and/or odors. 

The present invention also relates to the use of olfactory and/or taste receptors of the 
invention, biologically active fragments (hereof (including (hose having receptor activity), and 
polypeptides having at least 90% sequence identity therewith, in combination with one or more other 
olfactory and/or taste receptor polypeptides, to identify a compound or plurality of compounds that 

20 modulate, mimic, and/or block a specific olfactory and/or taste sensation. 

The invention also relates to cells that express an olfactory or taste receptor polypeptide of the 
invention, a biologically active fragment thereof (including those having receptor activity), or a 
polypeptide having at least 90% sequence identity therewith, and the use of such cells in cell-based 
screens to identify molecules that modulate, mimic, and/or block specific olfactory or taste sensations. 

25 Still frirther, the invention relates to a cell that co-expresses at least one olfactory or taste G- 

protein coupled receptor polypeptide of the invention, and a G-protein, and optionally one or more other 
olfactory and/or taste G-protein coupled receptor polypeptides, and the use of such a cell in screens to 
identify molecules that modulate, mimic, and/or block specific olfactory and/or taste sensations. 
The invention further provides a method for screeniug a compound for effectiveness in 

30 altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:17-32, the method 
comprising a) exposing a sample comprisiog the target polynucleotide to a compound, and b) detecting 
altered expression of the target polynucleotide. 

The invention further provides a method for assessing toxicity of a test compound, said 

35 method comprising a) treating a biological sample containing nucleic acids with the test compound; b) 
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hybridiziBg the nucleic adds of the treated biological sample with a probe comprisiiig at least 20 

contiguous nucleotides of a polynucleotide selected jfrom the group consisting of i) a polynucleotide 

comprising a polynucleotide sequence selected from flie group consisting of SEQ ID NO:17-32, ii) a 

polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical to a 

5 polynucleotide sequence selected from die group consisting of SEQ ID NO:17-32, iii) a polynucleotide 
having a sequence complemraitary to i), iv) a po^raucleotide complementary to the polynucleotide of 
ii), and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions whereby a ^cific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide selected from the group consisting of i) a polynucleotide comprising 

10 a polynucleotide sequence selected from the group consisting of SEQ ID NO: 17-32, ii) a 

polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:17-32, iii) a polynucleotide 
complementary to the polynucleotide of i), iv) a polynucleotide complementary to the polynucleotide of 
ii), and v) an RNA equivalent of i)-iv). Alternatively, the target polynucleotide comprises a fragment 

15 of a polynucleotide sequence selected from the group consisting of i)-v) above; c) quantifying the 

amount of hybridization complex; and d) comparing the amount of hybridization complex in the treated 
biological sample with the amount of hybridization contqikx in an untreated biological sample, wherein 
a difierence in the amount of hybridization complex in the treated biological sample is indicative of 
toxicity of the test compound. 

20 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
sequences of the present invention. 

Table 2 sho\vs the GenBank identification number and annotation of tiie nearest QenBank 
25 homolog for polypeptides of the invention* The probability score for the match between each 
polypeptide and its GenBank homolog is also shown. 

Table 3 shows structural features of polypeptide sequences of the invention, including 
predicted motifs and domains, along with the methods, algorithms, and searchable databases used for 
analysis of the polypeptides. 
30 Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 

pol^ucleotide sequences of the invention, along with selected fragments of the polynucleotide 
sequences. 

Table 5 shows the representative cDNA library for polynucleotides of the invention. 
Table 6 provides an appendix which describes the tissues and vectors used for construction of 
35 the cDNA libraries shown in Table 5. 
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assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 

evolution of new protein functions. 

A *'fi:agnientf' is a unique portion of GCREC or flie polynucleotide encoding GCREC which is 

identical in sequence to but shorter in length dian the parent sequ^ce. A fragment may comprise up 

S to the entire lei^gth of the defined sequence, minus one nucleotide/amino acid residue. For example, a 

fragment may comprise from S to 1000 contiguous nucleotides or amino acid residues. A fragment 

used as a probe, primer, antigen, therapeutic molecule, or for other purposes, maybe at least 5, 10, 15, 

16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contigaous nucleotides or ammo add 

residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 

10 example, a polypeptide fragment may comprise a certain length of contigaous amino acids selected 

from the jSrst 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a certain 

defined sequence. Qearly these lengflas are exemplary, and any length that is supported by the 

specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 

embodiments. 

15 A firagment of SEQ ID NO:17-32 conqsrises a region of unique polynucleotide sequence ftiat 

specifically identifies SEQ ID NO:17-32, for example, as distinct &om any other sequence in the 
genome from which the fragment was obtained. A firagment of SEQ ID NO:17-32 is usefiil, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish SEQ 
ID NO:17-32 from related polynucleotide sequences. The precise lengfli of a fragment of SEQ ID 

20 NO:17-32 and the region of SEQ ID NO:17-32 to which the fragment corresponds are routinely 
determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID N0:M6 is encoded by a fragment of SEQ ID NO:17-32. A 
fragment of SEQ ID NO:l-16 comprises a region of unique amino acid sequence that specifically 
identifies SEQ ID N0:l-16. For example, a fragment of SEQ ID NO:l-16 is useful as an 

25 itnmnnogenic peptide for the development of antibodies that specifically recognize SEQ ID NO: 1- 16. 
The precise length of a fragment of SEQ ID NOil-ie and the region of SEQ ID N0:l-16 to which 
the fragment corresponds are routinely determinable by one of ordinary skill in the art based on the 
intended purpose for the fragment 

A "fiiH length" polynucleotide sequence is one containing at least a translation initiation codon 

30 (e.g., methionine) followed by an open reading frame and a translation termination codon. A "fiill 
length" polynucleotide sequence encodes a "fall length" polypeptide sequence. 

'Homology** refers to sequence similarity or, interchangeably, sequence identity, between two 
or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer to 

35 the percentage of residue matches between at least two polynucleotide sequences aligned using a 

17 
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ID NO:9 also contains a rhodopsin family 7>transniembrane receptor domain as determined by 

searchmg the HMM-based PFAM database. (See Table 3.) Data from BUMPS and BLAST 

analyses provide ftirfher corroborative evidence that SEQ ID N0:9 is a rhodopsin family G-protein 

coupled receptor. In an alternative example, SEQ ID N0:11 is 82% identical to Marmota marmota 

5 olfactory receptor (GenBank ID g5901488) as determined by BLAST. (See Table 2.) The BLAST 

probability score is L5e-101. SEQ ID N0:1 1 also contains a rhodopsin family 7-transmembrane 

receptor domain as determined by searching the HMM-based PFAM database. (See Table 3.) Data 

from BLIMPS, MOTIFS, and PROFDLESCIAN analyses provide further corroborative evidence that 

SEQ ID N0:11 is a G-protein coupled receptor. In an alternative example, SEQ ID N0:12 is 64% 

10 identical to Homo sapiens olfactory receptor (GenBank ID ^2792018) as determined by BLAST. 
(See Table 2.) The BLAST probability score is 8.4e-99. SEQ ID N0:12 also contains a Aodopsin 
family 7-transmembrane receptor domain as determined by searching die HMM-based PFAM 
database. (See Table 3.) Data from BLIMPS, MOTIFS, and PROFILESCAN analyses provide 
further corroborative evidence that SEQ ID N0:12 is a G-protein coupled receptor. In an alternative 

15 example, SEQ ID NO:15 is 57% identical to chicken olfactory receptor 4 (GenBank ID gl246534) as 
determined by BLAST. (See Table 2.) The BLAST probability score is l.le-91. SEQIDN0:15 
also contains a 7-transmembrane receptor (rhodopsin family) domain as determined by searching the 
HMM-based PFAM database. (See Table 3.) Data from BUMPS, MOTIFS, and PROFILESCAN 
analyses provide further corroborative evidence that SEQ ID N0:15 is an olfactory receptor. SEQ ID 

20 NO:l, SEQ ID NO:3, SEQ ID N0:7, SEQ ID NO:10, SEQ ID NO:13-14, and SEQ ID NO:16 were 
analyzed and annotated in a similar manner. The algorithms and parameters for the analysis of SEQ 
ID N0:l-16 are described in Table 7. 

As shown in Table 4, the full length polynucleotide seqaences of ihe present invention were 
assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any 

25 combination of these two types of sequences. Columns 1 and 2 list the polynucleotide sequence 
identification number (Polynucleotide SEQ ID NO:) and the corresponding Incyte polynucleotide 
consensus sequence number (Incyte Polynucleotide ID) for each polynucleotide of the invention. 
Column 3 shows the length of each polynucleotide sequence in basepairs. Column 4 lists fragments of 
the polynucleotide sequences which are useful, for example, in hybridization or amplification 

30 technologies lhat identify SEQ ID NO:17-32 or that distinguish between SEQ ID NO:17-32 and 
related polynucleotide sequences. Column 5 shows identification numbers corresponding to cDNA 
sequences, codiag sequences (expns) predicted from genomic DNA, and/or sequence assemblages 
comprised of both cDNA and genomic DNA. Hiese sequences were used to assemble the full length 
polynucleotide sequences of the invention. Colurons 6 and 7 of Table 4 show the nucleotide start (5') 

35 and stop (3') positions of the cDNA and/or genomic sequences in column 5 relative to their respective 

28 
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GNN, GFG, 


Bxon prediction from genomic sequences using, for example, 
(jENSCAN (Staniord University, CA, USA) or FuENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK) 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 


INCY 


Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resultiug transcript. 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in 
column 5 was obtained to confirm the final consensus polynucleotide sequence, but the relevant Incyte 
cDNA identification numbers are not shown. 

10 Table 5 shows the representative cDNA libraries for those fiiU length polynucleotide 

sequences which were assembled using Incyte cDNA sequences. The representative cDNA library 
is the Incyte cDNA library which is most irequently represented by the Incyte cDNA sequences 
which were used to assemble and confirm the above polynucleotide sequences. The tissues and 
vectors whidi were used to constract the cDNA libraries shown in Table 5 are described in Table 6. 

15 The invention also encompasses CJCEtEC variants. A preferred GCREC variant is one which 

has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the GCREC amino acid sequence, and which contains at least one fimctional or 
structural characteristic of GCREC. 

The invention also encompasses polynucleotides which encode GCREC. In a particular 

20 embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO: 17-32, which encodes GCREC. The polynucleotide 
sequences of SEQ ID NO:17-32, as presented in the Sequence Listing, embrace tiie equivalent RNA 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uradl, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

2S The invention also encompasses a variant of a polynucleotide sequence encoding GCREC. In 

particular, such a variant pol3mucleotide sequence will have at least about 70%, or alternatively at least 
about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding GCREC. A particular aspect of the invention encompasses a variant of a 
polynucleotide sequence comprising a sequence selected from the group consistiug of SEQ ID N0:17- 

30 32 which has at least about 70%, or alternatively at least about 85%, or even at least about 95% 

polynucleotide sequence identity to a nucleic acid sequence selected firom the group consisting of SEQ 
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concentratioii of DNA. A 5 /il to 10 ^\ aliquot of the reaction mbctare was analyzed by 

electrophoresis on a 1 % agarose gel to determine which reactions were successfdl in extending the 

sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-wen plates, 
5 digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WT), and 
sonicated or sheared prior to religation into pUC 18 vector (Amershanx Pharmacia Biotech). For 
shotgun sequencings the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACB (Promega). Extended 
clones were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector 

10 (Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction 
site overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 
antibiotic-containing media, and mdividual colonies were picked and cultured ovemi^ at 37 °C in 384- 
well plates in 13/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PGR using Taq DNA polymerase 

15 (Amersham Pharmacia Biotech) and Pfii DNA polymerase (Stratagene) with flie following 

parameters: Step 1: 94 ^C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60X, 1 min; Step 4: 72 X, 2 mim Step 
5: steps 2, 3, and 4 repeated 29 times; Step 6: 72*^0, 5 min; Step 7: storage at 4**C. DNA was 
quantified by HCOGKEEN reagent (Molecular Probes) as described above. Samples with low DNA 
recoveries were reamplified using &e same conditions as described above. Samples were diluted with 

20 20% dimethysulfoxide (1 :2, v/v), and sequenced using DYENAMIC energy transfer sequencing 
primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM 
BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

In like manner, foil length polynucleotide sequences are verified using the above procedure or 
are used to obtain 5* regulatory sequences using the above procedure along with oligonucleotides 

25 designed for such extension, and an appropriate genomic library. 
IX. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:17-32 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is spediically described, essentially the same procedure is used with larger nucleotide 

30 fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 /^Ci of 
[Y-32p] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exchision dextran bead column (Amersham Pharmacia Biotech). 

35 An aliquot containing 10^ counts per minute of the labeled probe is used in a typical membrane-based 
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71. A polypeptide of claim 1, comprising the amiao acid sequence of SEQ ID NO:10. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NOrll. 

73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:12. 

74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:13. 

75. A polypeptide of claim 1, comprising the amino add sequence of SEQ ID N0:14. 

76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:15. 

77. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:16. 

78. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:17. 

79. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:18. 

80. A polj^nucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:19. 
SLA polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:20. 

82. A polynucleotide of claim 12, conq)rising the polynucleotide sequence of SEQ ID N0:21, 

83. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:22. 

84. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:23. 

85. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:24. 

86. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:25. 

87. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:26. 
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cagttcctct catatctccc catttggaga 
aaacagttat ttttgcagtt ttctttcacg 
tttttgctga ttacccaact ttcagtattt 
gaatcgtgct taaatcaaat gtaccttcag 
ggaaaatccc cacgttgctc tccctgggga 
tgtaatcatg ctgtcatttc agagccacag 
gaaaaaaaaa aaa 



cagtgagcca ctggaaagtt gtaaaaacaa 1680 
catttatagt gctctggata atgccattta 1740 
gctgtgttat catctgtatt tacttatttt 1800 
cacctgcaag tttgcctttt ctttccagga 1860 
gtctgagaat tataccagtg ctgtcagaaa 1920 
agtatttata aaataaaaac ctttcccacg 1980 

1993 



<210> 22 

<211> 1499 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID Mo: 6157025CB1 



<400> 22 

aaatgcatgc agagcatgga a.a.tgccccca 
tggaaggcag acatgtggcc catctctgta 
gcctttctct ctgttgccca ggctggagtt 
gcatcgacct ccggggctca agtgatcctt 
caggttttcc tgggatcagc tgcactcctt 
aagtaactga gacatttcaa tcatttctag 
attacggtga cttcccaagt atgcctggcc 
ctatagtgac accccactta atcagcctct 
gtgtcatttc cattcttttc ctcctggtga 
cggtcattaa cttggtggtg gtccacagcg 
cctacctcat caagaagact tggatgtttg 
tgctgcacat ccacatgtac ctcacgttcc 
acctcatctt cttcaagtgc aaagacaaag 
ctgccagtgc tggcatgtgg acgctggtga 
ggtatiggaat ccatgaggaa tacaatgagg 
cttacacata tgtgaaaatc atcaactata 
tgattctgtt ggtcttccag gtcttcatca 
ctttactatc ccaccaggag ttctgggctc 
tccttgtttg tttccttccc taccagttct 
attccaatgc ctgtaacagc aaggttgcat 
caattagctg ctatgatttg cttctctttg 
agataatggc ttatggaatt gtgtttgtgc 
gcttccttta tattgggaat aaaaatgggt 
ttgatcaaag catgccttga tgtaaccaaa 
ttgtagtcct attgggatcc tccatctcga 



gctgccctgc tgttgaaaca gaatcctatt 60 
gccatcactg agaaatctgg attttcaagg 120 
tagcgactca atcatggctc actgactgca 180 
tcatctcagc ctcctcagta gctgagacta 240 
agcaaaagta tattggagaa tcaactgaga 300 
gtgtaaagaa agaccagatc ccaggaaaat 360 
acaatacctc caggaattcc tcttgcgatc 420 
acttcatagt gcttattggc gggctggtgg 480 
aaatgaacac ccggtcagtg accaccatgg 540 
tttttctgct gacagtgcca tttcgcttga 600 
ggctgccctt ctgcaaattt gtgagtgcca 660 
tattctatgt ggtgatcctg gtcaccagat 720 
tggaattcta cagaaaactg catgctgtgg 780 
ttgtcattgt ggtacccctg gttgtctccc 840 
agcactgttt taaatttcac aaagagcttg 900 
tgatagtcat ttttgtcata gccgttgctg 960 
ttatgttgat ggtgcagaag ctacgccckct 1020 
agctgaaaaa cctatttttt ataggggtca 1080 
ttaggatcta ttacttgaat gttgtgacgc 1140 
tttataacga aatcttcttg agtgtaacag 1200 
tctttggggg aagccattgg tttaagcaaa 1260 
cgttagccac aaactacagt attcatattt 1320 
ataggggagg taagaatggt atttcattac 1380 
aca€iaaggac tataaatgca agagccct;ca 1440 
gtgatggcgt acaagacccg tgttgtcgc 1499 



<210> 23 
<211> 2455 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 55012817CB1 
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